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Abstract 

For  a  directed  graph  G ,  a  t-identifying  code  is  a  subset  S  C.V (G) 
with  the  property  that  for  each  vertex  v  €  V{G)  the  set  of  vertices 
of  S  reachable  from  v  by  a  directed  path  of  length  at  most  t  is  both 
non-empty  and  unique.  A  graph  is  called  t -identifiable  if  there  exists  a 
t-identifying  code.  This  paper  shows  that  the  de  Bruijn  graph  B(d,n ) 
is  1-  and  2-identifiable  and  examines  conditions  under  which  it  is  not 
t-identifiable.  This  paper  also  proves  that  a  t-identifying  code  for  t- 
identifiable  de  Bruijn  graphs  must  contain  at  least  dn~1(d—  1)  vertices. 
Constructions  are  given  to  show  that  this  lower  bound  is  achievable 
for  1-identifying  codes  when  n  is  odd,  or  n  is  even  and  d  >  2,  and 
for  2-identifying  codes  when  n  >  3.  Further  a  construction  is  given 
proving  that  when  n  is  even  and  d  =  2  there  is  a  1-identifying  code  of 
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size  at  most  one  more  than  the  lower  bound.  Additionally  this  paper 
proves  that  for  B(d ,  n)  the  minimum  size  of  a  directed  resolving  set 
(a  subset  with  the  property  that  every  vertex  of  the  graph  can  be 
distinguished  by  its  directed  distances  to  vertices  of  S)  is  dn~1(d  —  1), 
and  that  if  d  >  n  the  minimum  size  of  a  determining  set  (a  subset  S 
with  the  property  that  the  only  automorphism  that  fixes  S  pointwise 
is  the  trivial  automorphism)  is  |" AA]  ■ 


1  Introduction 

First  introduced  in  1998  [8],  an  identifying  code  for  a  graph  G  is  a  subset 
S  C  V  ( G )  with  the  property  that  for  each  v  G  V (G)  the  subset  of  vertices 
of  S  that  are  adjacent  to  v  is  non-empty  and  unique.  That  is,  each  vertex 
of  the  graph  is  uniquely  identifiable  by  the  non-empty  subset  of  vertices  of 
S  to  which  it  is  adjacent.  Note  that  not  all  graphs  have  an  identifying  code; 
those  that  do  are  called  identifiable.  A  graph  fails  to  be  identifiable  if  and 
only  if  it  contains  a  pair  of  vertices  with  the  same  closed  (in-)neighborhood; 
such  vertices  are  called  twins.  Extending  these  definitions  to  t-identifying 
and  f-twins  is  easy  and  is  covered  in  Section  3.  Identifying  codes  can  be 
quite  useful  in  applications.  For  example,  an  identifying  code  in  a  network 
of  smoke  detectors  allows  us  to  determine  the  exact  location  of  a  fire  given 
only  the  set  of  detectors  that  have  been  triggered.  However,  the  problem  of 
finding  identifying  codes  is  NP-Hard  [5].  The  computational  cost  has  so  far 
limited  the  real-world  use  of  identifying  codes. 

The  directed  de  Bruijn  graph  B(d,n)  is  a  directed  graph  in  which  the 
vertices  are  strings  of  length  n  from  an  alphabet  A  with  d  letters,  and  with 
a  directed  arc  from  vertex  X\X2  ■  ■  ■  xn  to  vertex  x-i  ■  ■  ■  xna  for  each  a  £  A. 
When  looking  for  a  graph  model  for  applications,  it  is  useful  to  choose  a 
graph  with  relatively  few  edges,  but  many  short  paths  between  any  pair  of 
vertices  [  ].  The  de  Bruijn  graphs  have  both  of  these  desirable  properties. 
In  addition,  given  an  arbitrary  pair  of  vertices  in  a  de  Bruijn  graph,  there 
are  routing  algorithms  that,  with  high  probability,  create  a  path  of  length 
O(logn)  between  the  pair  [  ].  The  properties  of  de  Bruijn  graphs  enable 

some  problems  that  are  NP-complete  on  general  graphs,  such  as  the  Hamilton 
cycle  problem,  to  be  computationally  solvable  on  de  Bruijn  graphs  [12].  We 
will  see  that  for  most  de  Bruijn  graphs,  the  construction  of  minimum  size  1- 
and  2-identifying  codes  is  indeed  solvable. 
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Figure  1:  Vertex  Subset  Relationships 


Other  vertex  subsets  that  we  consider  for  de  Bruijn  graphs  in  this  pa¬ 
per  are  dominating  sets,  determining  sets,  and  resolving  sets.  Dominating 
sets  provide  complete  coverage  for  a  graph,  while  resolving  sets  provide  an 
identification  of  vertices  in  graphs  using  relative  distances.  Finally,  deter¬ 
mining  sets  provide  a  set  of  vertices  that  is  only  fixed  pointwise  by  the  trivial 
automorphism.  These  types  of  subsets  are  also  useful  in  applications.  For 
example,  resolving  sets  have  been  used  in  aiding  the  navigation  of  robots 
when  distances  to  sufficient  landmarks  are  known  [9],  and  determining  sets 
are  useful  in  graph  distinguishing  which  can  reduce  graph  symmetry  to  en¬ 
hance  recognition.  These  different  vertex  subsets  are  interrelated,  as  shown 
in  Figure  1.  For  example,  each  resolving  set  and  identifying  code  is  also  a 
determining  set,  but  not  vice-versa.  This  is  discussed  more  fully  in  Section 

4.1  and  Section  4.3. 

In  Section  2,  we  give  careful  definitions  necessary  for  working  with  di¬ 
rected  de  Bruijn  graphs.  In  Section  3,  we  prove  that  for  every  t-identihable 
de  Bruijn  graphs,  any  t-identifying  code  must  contain  at  least  —  1)  ver¬ 

tices.  We  prove  by  construction  that  this  bound  is  achievable  for  1-identifying 
codes  when  n  is  odd,  or  n  is  even  and  d  >  2,  and  for  2-identifying  codes  when 
n  >  3.  For  even  n  with  d  =  2  we  show  the  existence  of  a  1-identifying  code 
of  size  dn^1(d  —  1)  +  1,  which  is  at  most  one  more  than  optimal  size.  The 
existence  of  vertices  with  identical  t-in-balls  in  the  graph  (called  t-twins ) 
guarantees  that  the  graph  has  no  t-identifying  set.  In  Section  3.1  we  address 
the  question  of  when  B(d,n)  contains  t-twins.  In  Section  4  we  study  domi¬ 
nating  sets,  directed  resolving  sets,  and  determining  sets  for  B(d,n).  Section 

4.2  gives  a  proof  that  the  minimum  size  of  a  directed  resolving  set  for  B(d,  n) 
is  dn~1(d  —  1),  and  Section  4.3  that  the  minimum  size  of  a  determining  set 

-  PW 
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2  Definitions 


We  will  be  considering  various  types  of  vertex  subsets  on  the  class  of  directed 
de  Bruijn  graphs.  The  following  definitions  will  be  useful  in  working  with 
this  class  of  graphs. 

Definition  2.1.  Let  Ad  =  {0, 1, . . . ,  d—  1}  and  A7j  set  of  all  strings  of  length 
n  made  up  of  letters  of  A.  When  d  is  clear  from  context  we  will  use  A  and 
A1  respectively. 

— > 

Definition  2.2.  The  directed  de  Bruijn  graph,  denoted  B(d,  n),  has 

vertex  set  A7,,  with  an  edge  from  string  X\X2  ■  ■  ■  xn  to  string  yiy2  . . .  yn  exists 
if  and  only  if  x2x3  ...xn  =  yxy2  ■  ■  ■  yn- 1- 

Definition  2.3.  The  concatenation  of  two  strings  x  =  X\X2 . . .  Xi  and 
y  =  ym  ■■■ yk  is  given  by  x  ©  y  =  xxx2  . . .  Xiyxy2  ...yk. 


Definition  2.4.  The  concatenation  of  sets  of  strings  S  and  T  is  given 
by  S  ®T  =  {x  (By  \  x  e  S  and  y  G  T}. 

Definition  2.5.  The  prefix  of  a  string  x  =  X\X2...xn  is  the  substring 
x\x2  . . .  xn- 1,  denoted  by  x~ . 

Definition  2.6.  The  suffix  of  a  string  x  =  XiX2  . .  ,xn  is  is  the  substring 
x2xs  . . .  xn,  denoted  by  x+. 

Definition  2.7.  When  discussing  substrings  of  a  string  X\X2  . . .  xn ,  we  will 
use  the  notation  x(a  :  b )  to  denote  the  substring  xaxa+i . . .  x^. 

Definition  2.8.  If  a  string  X\X2  . . .  xn  contains  a  constant  substring,  i.e. 
x{l  :  n)  =  0,  then  we  will  denote  the  consecutive  letters  as  the  constant 
raised  to  an  exponent  that  denotes  the  length,  i.e.  x  =  0n.  This  will  also  be 
used  for  repeated  substrings,  such  as  0101 ...  01  =  (01)n/2. 

3  Identifying  Codes 

We  begin  by  building  up  to  the  definition  of  an  identifying  code.  This  requires 
careful  definitions  of  directed  distance  and  f-balls. 
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Definition  3.1.  The  directed  distance  from  vertex  u  to  vertex  v  in  a  graph 
G  is  given  by  d(u,v),  and  is  defined  as  the  length  of  the  shortest  directed 
path  from  u  to  v  in  G. 

Definition  3.2.  Let  v  E  V (G).  The  open  in-neighborhood  of  v  is  given  by 
N~(v )  =  {u  E  V(G )  |  (u,v)  E  E(G )},  and  the  closed  in-neighborhood 
is  given  by  IV- [v]  =  N~(v )  U  {u}. 

Note  that  out-neighborhoods  are  defined  analogously,  but  will  not  be 
needed  in  this  paper. 

Definition  3.3.  The  in-ball  of  radius  t  centered  at  vertex  v  is  the  set: 
B^T(v)  =  {u  E  V(G)  |  d(u,v )  <  t}.  and  the  out-ball  of  radius  t  centered 
at  vertex  v  is  the  set:  B^~ ( v )  ={u£  G(G)  \  d(v,u )  <  t}. 

The  two  following  lemmas  are  useful  in  working  with  distances  in  B(d,  n ) 
and  their  proofs  are  self-evident. 

Lemma  3.4.  In  B(d,  n )  there  is  a  directed  path  of  length  t  <  n  from  x  to  y 
if  and  only  if  x{t  +  1  :  n)  —  y{l  :  n  —  t).  That  is,  if  and  only  if  the  rightmost 
n  —  t  letters  of  x  are  the  same  as  the  leftmost  n  —  t  letters  of  y. 

Lemma  3.5.  In  B(d,n)  if  vertices  x  j -  y  have  the  same  prefix,  then  for  all 
u  7^  {x,y},d(u,x)  =  d(u,y).  In  particular,  B^(x)  \  {a:}  =  \  {y}  for 

all  t  <  n. 

Definition  3.6.  Given  a  subset  S  C  V(G),  the  S  t-identifying  set  for 

vertex  v  is  given  by  IDs(n)  =  B^{y)  D  S. 

Definition  3.7.  A  f-identifying  code  is  set  S  C  V(G)  such  that  each 
vertex  has  a  unique,  non-empty  identifying  set.  That  is,  for  every  u  E  V(G ), 
\Ds{u)  7^  0,  and  for  all  pairs  u,v  E  V(G)  we  have  IDs(u)  7^  IDs(w).  The 
variable  t  is  referred  to  as  the  radius  of  the  identifying  code.  See  Figure  2 
for  an  identifying  code  in  the  graph  B( 2,  3). 

In  the  above  definitions,  if  t  is  omitted  from  the  notation  (i.e.  identifying 
code  instead  of  f-identifying  code),  then  it  is  assumed  that  t  —  1. 

Note  that  not  every  graph  has  a  t-identifying  code  for  each  t.  In  particular 
if  the  graph  has  two  vertices  with  equal  in-balls  of  radius  t,  then  the  graph 
has  no  t-identifying  code.  The  topic  of  such  ‘t-twins’  and  the  resulting  non¬ 
existence  of  t-identifying  codes  is  covered  in  Section  3.1. 


5 


Figure  2:  A  1-identifying  code  in  the  graph  13(2,3)  (black  vertices).  A  2- 
identifying  code  in  this  graph  requires  all  vertices  but  one  of  {000,  111},  and 
there  are  no  f- identifying  codes  for  f  >  3. 

Theorem  3.8.  If  B(d,n)  is  a  t-identifiable  graph,  then  the  size  of  any  t- 
identifying  code  is  at  least  dn~1(d  —  1). 

Proof.  Choose  t  <  n  and  a  ^  h  in  A.  Suppose  that  for  some  w  G  An~l, 
niether  x  =  w  ©  a  nor  y  =  w  ©  h  is  a  set  S.  Since  x  and  y  share  a  prefix, 
by  Lemma  3.5,  Bf  (x)  \  {x}  =  Bf(y )  \  {y}.  Since  neither  x  nor  y  is  in  S, 
IDs(x)  =  IDs(y).  Thus  S  is  not  a  t-identifying  code.  Thus  for  each  w  G  An  ', 
a  f-identifying  code  must  contain,  at  least,  all  but  one  of  w  ©  a  for  a  G  A. 
Thus  a  f-identifying  code  for  B(d,n)  must  have  size  at  least  dn^1(d—  1).  □ 

Note  that  the  result  above  is  independent  of  the  radius  f.  An  interesting 
consequence  of  this  is  the  fact  that  increasing  the  radius  of  our  identifying 
code  does  not  produce  any  decrease  in  the  size  of  a  minimum  identifying 
code.  For  example,  consider  the  potential  application  of  identifying  codes 
in  sensor  networks.  One  might  think  that  by  increasing  the  sensing  power 
(which  corresponds  to  the  radius  of  the  identifying  code)  we  would  be  able  to 
place  fewer  sensors  and  thus  incur  a  savings  overall.  However,  Theorem  3.8 
implies  that  providing  more  powerful  (and  thus  more  expensive)  sensors  does 
not  allow  us  to  place  fewer  sensors.  Thus  we  should  use  sensors  that  have 
sensing  distance  equivalent  to  radius  one.  In  fact,  in  the  case  of  2-identifying 
codes  in  B( 2,3),  we  actually  require  an  extra  vertex  for  a  minimum  size 
of  seven!  A  more  general  result  for  2-identifying  codes  for  n  >  3  is  given  in 
Theorem  3.11.  First,  we  will  look  at  examples  of  optimal  and  almost-optimal 
1-identifying  codes. 

Theorem  3.9.  If  n  is  odd,  or  n  is  even  and  d  >  2,  then 
S  =  And  \  {a  ©  Ay2  ©  a  |  a  G  Ad} 
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is  an  identifying  code  for  B(d,n).  Further  this  identifying  code  has  optimal 
size  ( d  —  1  )dn~1. 

Proof.  Define  S  as  in  the  statement  of  the  theorem.  First,  we  will  see  that  the 
identifying  set  for  every  vertex  has  size  either  d  or  d—  1.  Let  x  =  x\x2  ■  ■  ■  xn, 
then 

N~(x )  n  S  =  {A®XiX2  ■  ■  ■ Xn-i }  \  {Xn-\X\X2  ■  ■  ■  Xn-\}. 

If  x\  =  xn,  then  IDs  (a;)  =  N~(x )  fl  S  has  size  d  —  1.  Whereas,  if  x\  xn, 
then  IDs  (a;)  =  {x}  U  N~(x )  fl  S  has  size  d. 

From  this  it  is  clear  that  every  vertex  has  a  non-empty  identifying  set. 
However  we  must  also  show  that  every  identifying  set  is  unique.  Suppose 
there  are  two  distinct  vertices  x,  y  G  V(B(d,n))  such  that  IDg(x)  =  ID  s{y). 
Call  their  identical  identifying  set  T.  We  look  at  the  two  cases,  \T\  =  d  and 
\T\  =  d  —  1,  separately  below. 

Suppose  that  \T\  =  d.  Then  {x,y}  CT  by  our  assumption  on  T  and  our 
earlier  reasoning.  Since  x  ^  y,  this  means  that  B(d,  n)  contains  both  directed 
arcs  x  — >  y  and  y  — >  x.  This  allows  us  to  conclude  that  {x,  y}  =  {(a6)fc,  ( ba)k } 
for  some  distinct  a,b  £  A  with  k  =  n/2.  In  particular  we  must  have  n  even. 
Below  are  the  precise  identifying  sets  for  x  and  y. 

ID s{(ab)k)  =  {(ab)k,  ( ba)k }  U  {c(ab)k~la  \  c  e  A  \  {a,  b}} 

ID s((ba)k)  =  {( ab)k ,  (6a)fc}  U  {c{ba)k~lb  \  c  e  A  \  {a,  b}} 

If  d  >  2  these  two  identifying  sets  are  in  fact  different,  which  is  a  contra¬ 
diction. 

Suppose  that  \T\  =  d  —  1.  Then  neither  x  nor  y  is  in  T,  which  means 
neither  is  in  S.  However  since  their  identifying  codes  are  identical,  this 
means  that  they  have  identical  first  neighborhoods.  By  definition  of  first 
neighborhoods,  this  means  that  x  and  y  have  the  same  prefix  but  different 
final  letters.  By  then  definition  of  S,  one  of  x,  y  (if  not  both)  is  a  member  of 
S,  which  is  a  contradiction.  □ 

Theorem  3.10.  If  n  is  even,  then 

S'  =  {00(10)fc_1}  U  A‘2  \  {a  ©  A^2  ©  a  \  a  e  A2} 

is  an  identifying  code  for  B(2,  n) .  Further,  this  identifying  code  is  within  one 
of  being  optimal. 
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Proof.  We  showed  in  the  proof  of  Theorem  3.9  that  if  x  7 -  y  have  the  property 
that  Bf(x)  fl  S  —  Bf(y)  D  S  with  n  even  and  d  —  2  then  we  must  have 
{x,y}  =  {( ab)k ,  ( ba)k }  for  a  7^  b  G  A2.  By  adding  00(10)fc_1  to  achieve  S', 
we  are  either  adding  an  in-neighbor  of  x  or  of  y  (but  not  both)  that  was  not 
in  S.  This  yields  identifying  sets  of  different  size  for  x  and  for  y.  Note  that 
either  vertex  from  00(10)fc_1,  ll(01)fc_1  would  work  to  supplement  S  in  this 
case.  □ 

Note  that  while  Theorem  3.10  provides  an  almost  optimal  solution,  1- 
identifying  codes  of  size  2n_1  are  indeed  possible  in  13(2,  n)  with  n  even.  For 
example,  the  following  set  is  a  1-identifying  code  of  size  8  in  B( 2,4): 

{0001, 0010, 0100, 0111, 1000, 1011, 1101, 1110}. 

The  following  constructs  2-identifying  codes  for  all  B(d,n )  with  n  >  3. 

Theorem  3.11.  Let  n  >  3  and  S  =  \  {x3ax3X4 ...  xn-\a  \  a  G  Ad}- 

If  n  is  even,  then  S  is  a  2-identifying  code  for  B(d,  n).  If  n  is  odd,  then 
S'  =  (S  U  {(ab)^~b  |  a  b  G  *4.2})  \  {(a&)^“ a  \  a  7^  b  G  A2}  is  a  2- 
identifying  code  for  B(d,  n).  In  both  these  cases,  the  2-identifying  code  is  of 
optimal  size  dn~1(d  —  1). 

Proof.  Consider  an  arbitrary  string  x  =  x  1X2X3  . . .  xn  G  Alf,  and  define  the 
set  T  =  IDs(x).  We’ll  consider  the  contents  of  T  in  four  cases  based  on  the 
equality  of  x\,xn_i  and  of  x2,xn.  First  let  C  =  {ax  |  a  G  ^4\  {xn_2}}. 

Case  1.  If  X2  =  xn  and  X\  =  r„_i,  then  T  =  A®  C.  Thus  \T\  =  d2  —  d. 

Case  2.  If  X2  7^  xn  and  x\  =  xn-i,  then  T  =  (A  ©  C)  U  {x}.  If 
x  G  A  ©  C  then  x+  =  ax  for  some  a  G  A  \  {xn-2}.  In  this  case, we  have 
x2x3  ■  ■  ■  xn  =  ax  1X2  ■  ■  ■  xn_2.  This  implies  that  we  have  xi  =  x3  =  x5  =  •  •  • , 
and  also  that  x2  =  x4  =  x6  =  •  •  • .  Since  this  case  requires  that  X\  =  xn_i, 
we  must  have  that  either  n  is  even  or  that  X\  —  x2  —  x3  —  ■  ■  ■  —  xn.  In 
either  case,  this  contradicts  our  assumption  that  x2  ^  xn.  Thus  x  A  ©  C, 
and  we  conclude  that  |T|  =  d2  —  d  +  1. 

Case  3.  If  x2  =  xn  and  x\  xn_i,  then  T  =  A  ©  {C  U  {x-}}.  If 
x_  =  ax  for  some  a  G  ^4\{xn_2},  then  ax ix2  •  •  •  xn_2  =  XiX2  •  •  •  xn_i.  This 
implies  that  we  have  x\  —  x2  =  x3  —  ■  ■  ■  —  xn_2  =  xn_i.  This  contradicts 
our  assumption  that  Xi  7^  xn_i-  Thus  x~  7^  ax  for  any  a  G  A  \  {xn_2}, 
and  we  conclude  that  \T\  =  df. 


Case  4.  If  x2  7^  xn  and  X\  7^  xn_ \ ,  then  T  =  (A  ©  {C  U  {a;-}})  U  {x}. 
As  in  Case  3,  since  X\  7^  xn_i,  .A©  {C  U  {a:-}}  contains  d 2  distinct  elements. 
Let  us  consider  whether  x  G  A  ©  {C  U  {a;-}}.  If  not,  then  \T\  —  d2  +  1. 
There  are  two  cases  to  consider. 

a.  If  x  G  .A  ©  C,  then  x+  =  x~ .  In  this  case,  we  must  have  that 
X2X3X4  ■  ■  ■  xn  —  x\x2  ■  ■  ■  xn- 1,  which  implies  that  we  have  the  following  chain 
of  equalities:  aq  =  x2  —  x3  —  ■  ■  ■  —  xn_  \  =  xn.  This  contradicts  the 
assumptions  that  x2  7^  xn  and  aq  7^  xn_i .  Thus,  this  case  does  not  occur. 

b.  If  x  e  A  ©  {a;-},  then  x+  =  ax  for  some  a  &  A.  Then 
x2x3---xn  =  aaqaq  •  •  •  xn_2-  This  implies  that  aq  —  x3  —  x5  —  ■  ■  • ,  and 
also  that  aq  =  aq  =  xe  =  ■  ■  ■ .  If  n  is  even,  this  contradicts  our  assumptions 
that  aq  7^  xn  and  aq  7^  xn-i-  Thus  for  even  n,  this  case  does  not  occur.  For 
n  odd,  this  case  only  occurs  if  x  G  {(ab)^~ a}. 

77.  —  1  _ 

Thus,  if  n  is  even,  or  n  is  odd  and  x  a},  we  can  see  that 

T  =  \Ds(x)  completely  determines  the  string  x.  In  particular,  given  T  we  can 
decide  which  case  we  are  in  based  on  \T\.  We  can  then  determine  aq, . . . ,  xn 
based  on  the  content  of  T.  Thus  in  these  cases  S  is  an  identifying  code. 

However,  if  n  is  odd,  and  x  G  {(ab)^~ a}  we  must  change  S  to  get  an 
identifying  code.  Note  that  a)U{(ah)IL2_6}  =  B2  {{ab)~^~b) .  Since 

our  set  S  contains  vertices  of  the  form  ( ab)~2~ a  but  not  (ab)~2~b,  these  two 
types  of  vertices  must  have  identical  identifying  sets  with  respect  to  S.  Thus 
by  adding  the  vertices  in  {{ab)~2~b},  we  are  able  to  create  distinct  identifying 
sets  with  respect  to  S  U  {(ab)^~ b}.  However,  we  note  that  we  now  have 
the  vertices  of  {(ab) ~^~b}  and  {( ab )IV“ a}  in  our  identifying  code,  but  that 
B2((ab)^~  a)  U  {b(ba)R 2“}  =  H^"(6(6a)^“).  This  implies  that  the  inclusion 
of  both  (ab)^~b  and  ( ab )2V“ a  in  our  identifying  code  is  only  necessary  if  they 
are  required  to  identify  vertex  (ab)^r a  from  vertex  b(ba)~ a-.  So,  as  long  as 
we  can  identify  (ab)^~ a  differently  from  b(ba)^r  without  using  b{ba)~^~ ,  we 
need  only  include  (ab)^~b  and  not  (ab)^~ a  in  our  identifying  code.  Since 
these  two  vertices  have  disjoint  in-balls  of  radius  2  for  n  >  3,  they  must  have 
distinct  2-identifying  sets.  Thus  S'  is  a  2-identifying  code  in  this  case.  □ 

3.1  Non-Existence  Results 

While  we  have  just  seen  1-  and  2-identifying  codes  for  many  de  Bruijn  graphs, 
not  all  de  Bruijn  graphs  have  ^-identifying  codes  for  all  t.  For  example,  there 
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are  no  4-identifying  codes  for  13(2, 5),  B(3, 5),  and  13(3, 6),  among  others. 
The  problem  of  existence  and  non-existence  of  t-identifying  codes  in  B(d,n) 
is  explored  in  this  subsection. 

Definition  3.12.  Two  vertices  u,v  G  V(G )  are  called  t-twins  whenever 
Bf(u)  =  Bf(v).  If  the  graph  has  no  t-twins,  then  G  is  called  t- twin- free. 

Theorem  3.13.  [  ]  For  a  given  graph  G  and  integer  t,  G  has  a  t-identifying 
code  if  and  only  if  it  is  t-twin-free. 

From  this  theorem,  we  can  immediately  determine  an  upper  bound  on  t 
for  B(d,n )  to  be  t- identifiable. 

Corollary  3.14.  The  graph  B(d,n )  does  not  have  a  t-identifying  code  for 
any  t  >  n. 

Proof.  For  t  >  n,  we  note  that  Bf(x)  =  V(B(d,n )),  and  so  all  vertices  in 
our  graph  are  twins.  □ 

The  following  theorem  illustrates  another  difficulty  in  attempting  to  use 
identifying  codes  of  larger  radius. 

Theorem  3.15.  When  n  is  even,  there  is  no  (n  —  1) -identifying  code  for 
B(d,  n). 

Proof.  Consider  vertices  x  =  (01)^01  and  y  =  (01)  ^00  in  B(d,n).  Both 
of  £>r7_i (x)  and  Bf_±(y)  consist  of  all  strings  that  end  in  either  0  or  01. 
Thus  these  vertices  are  (n  —  l)-twins  and  B(d,n )  has  no  (n  —  l)-identifying 
code.  □ 

TL  —  1 

Note  that  this  only  works  for  n  even.  Suppose  n  is  odd  and  x  =  (01)-2-0 

u j 

and  y  =  (01) “2“  1.  Since  y  G  B~_x(y)  but  x  B~_x  (y),  these  vertices  can  be 
distinguished  by  including  y  in  S. 

Theorem  3.16.  InB(d,n),  if  x  andy  aret-twins,  they  are  also  (t  +  1) -twins. 

Proof.  Suppose  that  z  G  Bf+1(x).  Then  we  know  that  either  z  G  Bf(x ) 
or  z  G  Bf+l(x)  \  Bf(x).  If  z  G  Bf(x),  then  since  x  and  y  are  t- twins, 
z  G  Bf(y )  z  G  Bf+1(y).  If  z  G  Bf+1(x)  \  Bf(x),  d(z,x )  =  t  +  1  so  there 
exists  w  G  Bf(x )  with  an  arc  from  z  to  w.  But  again,  since  x  and  y  are 
t- twins,  w  G  Bf(y )  z  G  Bf+1(y).  Thus  Bf+1(x)  C  Bf+l(y).  The  same 

argument  shows  that  Bf+l(y)  C  Bf+1(x).  Hence  we  have  equality,  so  x  and 
y  are  (t  +  l)-twins  in  B(d,  n).  □ 
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Corollary  3.17.  If  there  is  no  t-identifying  code  in  B(d,n),  then  there  is  no 
(t  +  1) -identifying  code  in  B(d,n). 

Theorem  3.18.  Let  x  and  y  be  t-twins  in  B(d,  n).  Define  k  to  be  the  least 
integer  such  that  (1)  k  >3,  and  (2)  {x,y}  C  A%.  Then  for  all  5  >  k,  x  and 
y  are  t-twins  in  B(S,n). 

Proof.  We  have  two  cases.  We  define  the  notation  Bff{x,d)  =  Bf  (x)  CiA'f. 
Case  1:  k  <8  <d.  Note  that  Af  C  Ad,  so  we  have  the  following. 


Bf(x,8)  =  B~(x)nA n5 

=  Bf(x)n(AfnA^) 
=  (Br(x)nAnd)nA% 
=  Bf  (x,  d)  n  A'f 
=  B~(y,d)nA % 

=  Bf(y,8) 


Thus  x  and  y  are  t-twins  in  B(5,  n). 

Case  2:  k  <  d  <  5.  Let  z  G  Af,  and  define  a  =  d(z,x )  (distance  from  z 
to  x)  and  fd  =  d(z,y).  Without  loss  of  generality,  assume  that  a  >  f3.  Then 
we  must  have  that  zp+iZg+2  ■  ■  ■  zn  e  A'^13 . 

Select  w  G  Af  such  that  wg+iWy+2  ■  ■  ■  wn  =  zp+1zp+2  ■  ■  ■  zn,  and  so  that 
the  following  hold  for  the  first  fd  letters  of  w :  for  each  i  G  \fd\,  we  have 
Wi  G  Ak  \  {iCi ,  2/1 } .  Note  that  this  is  possible  since  k  >  3.  By  the  design  of 
w,  we  have  ensured  that  d(w,x )  =  d(z,x)  =  a  and  d(w,y)  =  d(z,y)  =  fd  in 
the  graph  B(k,n).  Since  x  and  y  are  t-twins  in  B(k,n),  we  know  that  either 
both  a, /d  >  t,  or  both  a,  fd  <  t. 

Returning  to  our  original  definition  of  a  and  fd,  this  implies  that  either 
z  is  in  both  Bf(x,8 )  and  Bf(x,5),  or  z  is  in  neither  t-in-ball.  Since  z  was 
arbitrary,  this  implies  that  x  and  y  are  t-twins  in  B(8,n).  □ 

Corollary  3.19.  If  there  is  no  t-identifying  code  in  B(d,n )  and  d  >  3,  then 
there  is  no  t-identifying  code  in  B(8,n )  for  any  8  >  d. 


We  now  will  prove  a  converse  theorem. 
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Theorem  3.20.  Let  x  and  y  be  t-twins  in  B(d,n),  and  let  ip  :  Ad  Ad- 1 
be  a  surjection.  If  tp(x)  ^  <p(y),  then  <p(x)  and  <p(y)  are  t-twins  in  B(d,n). 

Proof.  We  will  first  prove  that  ip(Bf  (x,d))  —  Bf((p(x),d  —  1).  Once  this  is 
done,  then  we  will  have  the  following. 

B~(ip(x),d-l)  =  <p(Bi(x,d)) 

=  (p(B^(y,d)) 

=  B^((p(y),d-1). 

Hence,  by  proving  the  key  equality,  and  by  assuming  that  <p(x)  ^  <p(y), 
we  will  have  produced  a  pair  of  t-twins  in  B(d  —  1  ,n).  We  now  prove  the 
equality  by  showing  containment  in  both  directions. 

Let  z  G  ip(Bf(x,d))-  Then  there  exists  some  (  G  B(d,n)  such  that 
(  G  Bf(x,d)  and  <p(C)  —  z-  Hence  d((,  x)  —  a  <  t  for  some  nonnegative 
integer  a,  and  so  ((a  +  1  :  n)  =  x(l  :  n  —  a).  Since  these  substrings  are 
equal,  we  also  must  have  p>(((a  + 1  :  n))  —  <p(x(X  '■  n  ~  a))  =  :  n  —  a). 

Thus  d(tp(() ,  tp(x))  <  a  <  t  in  B(d  —  1  ,n),  and  so  z  G  Bf((p(x),d  —  1),  and 
therefore  < p{Bff{x ,  d))  C  Bf(<p(x),  d  —  1). 

For  the  reverse,  let  z  G  Bf{p{x)^  d  —  1),  and  then  we  suppose  that 
d(z,(p(x))  =  b  <  t  for  some  nonnegative  integer  b.  This  then  tells  us  that 
z(b  +  1  :  n)  —  <^(:r)(l  :  n  —  b)  =  <p(x{\  :  n  —  b)),  which  are  over  the  al¬ 
phabet  {0, 1,  2, . . . ,  d  —  2}.  We  construct  a  string  (  G  B(d,n )  such  that 
£(1  :  b )  =  tp(z(  1  :  b))  and  Q(b  +  1  :  n)  =  x(l  :  n  —  b).  From  this  construc¬ 
tion,  it  is  clear  that  d((,  x)  <b<t  and  ip(()  =  z.  Hence  (  G  Bf(x,d)  and 
thus  z  G  ip(Bf(x,  d)).  Therefore  Bf(tp(x),d  —  1)  C  < p(B^(x,d )),  and  so  the 
equality  is  shown.  □ 

Corollary  3.21.  If  there  is  no  t-identifying  code  in  B(d,n )  and  d>  3,  then 
there  is  no  t-identifying  code  for  B(d  —  1,  n) . 

Proof.  If  there  is  no  t-identifying  code  in  B(d,n ),  then  there  must  exist  a 
pair  of  t-twins  in  the  graph  -  call  them  x  and  y.  Using  Lemma  3.20,  to  find  a 
pair  of  t-twins  in  B(d  —  1,  n),  we  need  only  show  that  there  exists  a  surjection 
<p  :  Ad  — >  Ad-i  such  that  <p(x)  ^  p>(y)  .  Since  x  j -  y,  there  must  be  some 
index  i  such  that  Xi  ^  y Since  d—  1  >  2,  we  can  choose  a  surjection  <p  such 
that  (p(xi)  7^  <p(yi),  and  thus  <p(x)  ^  <fi(y)-  Then  (p(x)  and  (p(y)  are  t-twins 
in  B(d  —  1 ,  n),  and  so  there  is  no  t-identifying  code.  □ 
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Combining  Corollaries  3.19  and  3.21,  we  arrive  at  the  following  theorem. 

Theorem  3.22.  Let  n,d  G  Z+  such  that  d>  3.  There  exists  a  t-identifying 
code  in  B(d,  n )  if  and  only  if  there  exists  a  t-identifying  code  in  B(d  +  1,  n ). 
Moreover,  if  a  t-identifying  code  exists  for  B(2,n),  then  a  t-identifying  code 
exists  for  B(d,  n ). 


4  Dominating,  Resolving,  and  Determining 
Sets 

In  this  section  we  examine  other  types  of  vertex  sets  which  identify  or  classify 
vertices  up  to  some  graph  property.  The  properties  used  to  define  these  sets 
are  adjacency,  distance,  and  automorphisms. 

4.1  Dominating  Sets 

Definition  4.1.  A  (directed)  t-dominating  set  is  a  subset  S  C  V(G)  such 
that  for  all  v  G  V(G)  we  have  Bf{y)  fl  S  ^  0.  That  is,  S'  is  a  (directed) 
f-dominating  set  if  every  vertex  in  G  is  within  (directed)  distance  t  of  some 
vertex  in  S. 

Note  that  by  definition  every  identifying  code  is  also  a  dominating  set, 
but  not  conversely. 

Theorem  4.2.  [  ]  For  d  >  2,  n  >  1,  dom(B(d,n))  =  . 

In  [10]  a  construction  of  a  minimum  dominating  set  for  B(d,n )  is  given. 
Key  to  this  construction  is  the  integer  x  defined  by: 

_  f  dn~ 2  +  dn~A  + - b  dn~2k  + - b  d2  +  1  mod  dn ,  if  n  is  even; 

1  dn~2  +  dn~ 4  + - b  dn~2k  + - b  d3  +  d  mod  dn ,  if  n  is  odd. 

Let  D  =  {x,x  +  1, . . .  ,x  +  |~^y]  —  1}.  Now  let  S  be  the  set  of  strings  in 

TIf  that  correspond  (base  d)  to  the  integers  in  D.  Then  S  is  a  minimum  size 

dominating  set  for  B(d,n). 

We  also  provide  the  following  lower  bound  for  the  size  of  a  minimum 
f-dominating  set,  and  initial  testing  shows  that  this  is  a  good  estimate  on 
size. 
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Figure  3:  A  directed  resolving  set  in  the  graph  23(2,3)  (black  vertices). 

Theorem  4.3.  Fix  d,  n,  t  G  Z+.  Then  we  have  the  following  lower  bound  on 
the  size  of  a  t-dominating  set. 

domt(B(d,n ))  > 

Proof.  The  graph  B(d,7i)  has  dn  vertices,  and  the  maximum  size  of  a  f-out- 
ball  is  df  +  1.  This  gives  us  a  lower  bound  as  shown.  □ 

4.2  Resolving  Sets 

Definition  4.4.  A  directed  resolving  set  is  a  set  S  such  that  for  all 
u,v  G  V(G )  there  exist  s  G  S'  so  that  d(s,u )  ^  d(s,v).  The  directed 
metric  dimension  is  the  minimum  size  of  a  directed  resolving  set.  An 
example  of  a  directed  resolving  set  in  23(2,  3)  is  given  in  Figure  3. 

Note  that  this  definition  is  not  quite  the  same  as  that  given  in  [6]  (which 
requires  that  there  exist  s  G  S'  so  that  d(u,s )  ^  d(v,s)).  Our  definition 
corresponds  better  to  the  definitions  of  domination  and  of  identifying  codes 
for  directed  graphs  that  are  used  in  this  paper. 

Theorem  4.5.  The  directed  metric  dimension  for  B(d,n )  is  dn_1((i  —  1). 

Proof.  The  following  shows  that  for  each  w  G  An_1  a  directed  resolving  set 
for  B(d,7i)  must  contain  (at  least)  all  but  one  of  the  vertices  with  prefix  w. 
Suppose  that  w  G  AP~l,  and  i  ^  j  G  A  so  that  neither  of  w  ©  i,w  ©  j  is 
in  our  set  S'.  Note  that  if  x,  y  G  V(B(d,n)),  with  x  j -  y,  then  the  distance 
from  x  to  y  is  completely  determined  by  x~  (and  y+ ).  Since  neither  w  ©  i 
nor  w®j  is  in  S',  and  both  have  the  same  prefix,  d(w(Bi,x)  =  d(w(Bj,x)  for 


dn 

dt  +  l 
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all  x  £  S.  Thus  S  is  not  a  directed  resolving  set.  Thus  for  every  w  £  An~l, 
S  must  contain  (at  least)  all  but  one  of  the  strings  w  ©  j  for  j  £  A.  Thus 
| S' |  >  dn~l(d  —  1).  Since  {to  ©  0  |  w  £  A*1-1}  can  easily  be  shown  to  be  a 
directed  resolving  set,  we  have  the  desired  equality.  □ 

The  combination  of  Theorem  3.8  and  Theorem  4.2  yields: 

Corollary  4.6.  The  directed  metric  dimension  for  B(d,n)  is  equal  to  the 
minimum  size  of  a  t-identifying  code  for  B(d,  n ). 

4.3  Determining  Sets 

In  this  section  we  will  use  a  determining  set  to  help  us  illustrate  the  auto¬ 
morphism  group  of  B(d,  2),  study  the  relationship  between  Aut (B(d,  n  —  1)) 
and  Aut (B(d,n))  and  use  the  result  to  find  the  determining  number  for  each 
B(d,n).  First  let’s  recall  some  definitions. 

Definition  4.7.  An  automorphism  of  a  graph  G  is  a  permutation  7r  of 
the  vertex  set  such  that  for  all  pairs  of  vertices  u,v  £  V{G),  uv  is  an  edge 
between  u  and  v  if  and  only  if  is  an  edge  between  ir(u)  and 

An  automorphism  of  a  directed  graph  G  is  a  permutation  7 r  of  the  vertex 
set  such  that  for  all  pairs  of  vertices  u,v  £  V(G),  uv  is  an  edge  from  u  to 
v  if  and  only  if  ir(u)ir(v)  is  an  edge  from  n{u)  to  7r(r;).  One  automorphism 
in  the  binary  (directed  or  undirected)  de  Bruijn  graph  is  a  map  that  sends 
each  string  to  its  complement. 

Definition  4.8.  [3]  A  determining  set  for  G  is  a  set  S  of  vertices  of  G 
with  the  property  that  the  only  automorphism  that  fixes  S  pointwise  is  the 
trivial  automorphism.  The  determining  number  of  G,  denoted  Det(G)  is 
the  minimum  size  of  a  determining  set  for  G.  See  Figure  4  for  an  example. 

Note  that  an  alternate  definition  for  a  determining  set  is  a  set  S  with  the 
property  that  whenever  f,g£  Aut(G)  so  that  f(s)  =  g(s )  for  all  s  £  S,  then 
f(v )  =  g{y)  for  all  v  £  V{G).  That  is,  every  automorphism  is  completely 
determined  by  its  action  on  a  determining  set. 

Notice  that  since  for  both  directed  resolving  sets  and  for  identifying  codes, 
since  each  vertex  in  a  graph  is  uniquely  identified  by  its  relationship  to  the 
subset  by  properties  preserved  by  automorphisms,  the  subset  it  also  a  de¬ 
termining  set.  Thus  every  directed  resolving  set  and  every  identifying  code 
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Figure  4:  A  minimum  size  determining  set  for  B( 2,3)  (black  vertex). 


is  a  determining  set.  However,  though  domination  is  preserved  by  automor¬ 
phisms,  vertices  are  not  necessarily  uniquely  identifiable  by  their  relationship 
to  a  dominating  set.  Thus  a  dominating  set  is  not  necessarily  a  determin¬ 
ing  set.  However,  the  relationships  above  mean  that  the  size  of  a  minimum 
determining  set  must  be  at  most  the  size  of  a  minimum  identifying  code  or 
the  directed  metric  dimension.  For  de  Bruijn  graphs  we  have  shown  that  the 
latter  numbers  are  rather  large.  Does  this  mean  that  the  determining  num¬ 
ber  is  also  large.  We  will  see  in  Corollary  4.12  that  the  answer  for  directed 
de  Bruijn  graphs  is  a  resounding  ‘No’. 

Lemma  4.9.  S  =  {00, 11,  22,  33, . . . ,  (d  —  l)(d  —  1)}  is  a  determining  set  for 
B{d,  2). 

Proof.  Suppose  that  a  G  Aut(H(d,  2))  fixes  S  pointwise.  That  is,  a{ii)  =  ii 
for  all  i  G  A.  Choose  if  j -  rs  E  V(B(d,  2)).  Then  either  i  f  r  or  j  f  s  (or 
both).  If  i  f  r  then  d(ii,  rs)  =  2  which  is  distinct  from  d(ii,  if)  =  1.  Since  an 
automorphism  of  a  directed  graph  must  preserve  directed  distance,  a  {if  f  rs 
if  i  f  r.  If  j  f  s,  then  d(rs,jj )  =  2  which  is  distinct  from  d(ij,jj )  =  1. 
Thus,  again  using  that  a  preserves  directed  distance,  a(ij)  f  rs  if  j  f  s. 
Thus,  a(if  =  ij  for  all  if  G  V(B(d,  2))  and  therefore  a  is  the  identity  map 
and  S'  is  a  determining  set.  □ 

Note  that  we  are  using  directed  distances  both  from  and  to  elements  of 
the  set  S.  Thus  S  does  not  fit  the  definition  of  a  directed  resolving  set  for 
B(d ,  2)  (by  [6],  this  would  require  that  each  vertex  v  G  V (G)  be  distinguished 
by  it  directed  distance  to  the  vertices  of  the  resolving  set).  However  directed 
distances  both  to  and  from  a  set  can  be  used  in  determining  automorphisms 
of  a  directed  graph. 
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Lemma  4.10.  Aat(B(d,  2))  =  Sym(^4d). 

Proof.  Let  a  G  Sym(^4rf).  Define  on  V(B(d,n ))  by  applying  a  to  each 
vertex  coordinate- wise.  That  is  ipa(ab)  =  cr(a)a(b).  It  is  easy  to  show  that 
ipa  preserves  directed  edges  and  thus  is  an  automorphism.  Further,  distinct 
permutations  in  Sym(^4rf)  produce  distinct  automorphisms  since  they  act 
differently  on  the  vertices  of  the  determining  set  S  defined  above.  Thus  we 
have  an  injection  Sym(^)  <— >•  Aut(£>(d,  2)). 

Since  the  vertices  of  S  are  precisely  the  vertices  with  loops,  every  auto¬ 
morphism  of  B(d,  2),  must  preserve  S  setwise.  This  provides  the  necessary 
injection  from  Aut(£?(d,  2))  c->-  Sym(«4).  Thus,  Aut (B(d,  2))  =  Sym(^4d).  □ 

Note  that  we  can  consider  the  automorphisms  of  B(d,  2)  as  permutations 
of  the  loops,  but  we  can  simultaneously  consider  them  as  permutations  of 
the  symbols  in  the  alphabet  Ad-  It  can  be  useful  to  view  the  automorphisms 
in  these  two  different  ways. 

Note  that  as  shown  in  [1],  B(d,  n)  can  be  built  inductively  from  B(d,n  —  1) 
in  the  following  way.  The  vertex  x\  ...  xn  G  V(B(d,n ))  corresponds  to  the 
directed  edge  from  xi . . .  xn_\  to  x2 ...  xn  in  B(d,  n  —  1).  The  directed  edge 
B(d,7i)  from  x\...xn  — >  X2--.xnxn+i  corresponds  to  the  directed  2-path 
X\  . . .  xn-\  — y  x2  ■  ■  •  xn  — *  £3  . . .  xn+i  in  B(d ,  n  —  1).  That  is,  B(d,  n )  is  the 
directed  line  graph  of  the  directed  graph  B(d,n  —  1).  Thus,  by  [7]  (Chapter 
27,  Section  1.1),  Aut(B(d,  n—  1))  =  Aut (B(d,n))  =  Sym(^4).  In  the  following 
paragraphs  we  see  detail  this  correspondence. 

Suppose  that  ip  G  Aut (B(d,n)).  Since  ip  preserves  directed  edges,  we 
know  that  both  ip(x\ . . .  xn )  =  a\ . . .  an  and  <p(x 2  . . .  xn+i )  =  bi  . . .  bn  if  and 
only  if  a2  —  bi, . . . ,  an  —  bn_  1.  Thus  if  tp(x  1 . . .  xn_ixn )  =  cp  .  . .  an_ian  then 
for  every  b  G  A,  ipfx  1  . . .  xn-±z )  =  a\  . . .  an-\C  for  some  c  G  A.  In  particular, 
this  allow  us  to  define  an  automorphism  ip'  G  Aut (B(d,n  —  1))  correspond¬ 
ing  to  ip  G  Aut (B(d,n)).  Define  ip'  by  ip' (x\ . . .  xn-\)  =  a\ . . .  an_i  where 
ip(x\ . . .  xn )  =  cii . . .  an_  1.  By  the  preceding  discussion,  ip'  is  well-defined.  It 
is  also  clearly  a  bijection  on  vertices  of  B(d,7i  —  1).  Consider  x\  . .  .xn-\ 
and  x2 . . .  xn_ixn,  the  initial  and  terminal  vertices  of  a  directed  edge  in 
B(d,  7i  —  l).  Since  <p  preserves  directed  edges  if  ip(x  1 . . .  xn)  =  ai . . .  an_ian 
then  for  any  z  E  A,  ip(x2  . . .  xnz )  =  a2  . . .  anw  for  some  w  G  A.  By  defini¬ 
tion  of  tp',ip'(x  1, . . .  xn-i )  =  cl\  . . .  an- 1  and  ip'(x 2  . . .  xn)  =  a2  . . .  an.  Thus  ip' 
preserves  the  directed  edge.  Thus  we  get  Aut (B(d,  n))  "—t  Aut {B^dpn  —  1)). 
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In  the  other  direction,  suppose  we  are  given  p'  G  Aut  (B(d,  n  —  1)).  Since 
p'  preserves  directed  edges,  and  directed  edges  of  B(d,n  —  1)  are  precisely 
the  vertices  of  B(d,  n ),  p'  defines  a  map  on  vertices  of  B(d,  n).  That  is,  (with 
some  abuse  of  notation) 


p(x  i ...xn) 


p(x i . . .  xn-i  ->■  x2  . . .  xn) 
p\x  i . . .  xn_i  -+x2...xn) 


=  p\x  1  .  .  .  Xn_i)  ->■  p\x 2  .  .  .  Xn). 

Thus,  given  . . .  xn_i)  =  ai . . .  an_i  then  p'(x 2  ■  ■  ■  xn)  =  02  •  •  •  an  for 

some  an  G  A  and  we  dehne  p{x  1 . . .  xn)  =  ai . . .  an.  Further,  since  p'  pre¬ 
serves  directed  2-paths,  p  preserves  directed  edges.  Thus  we  get 


Aut  (B(d,  n  —  1))  *  Aut(£f(d,  n)). 


Since  the  automorphisms  of  B(d,  2)  are  permutations  of  the  loops,  and  of 
the  symbols  of  A,  by  induction,  so  are  the  automorphisms  of  B(d,n)  for  all 
n.  Thus  we  have  proved  the  following. 

Theorem  4.11.  Aut (B(d,n))  =  Sym(^4(i)  for  all  n  >  2. 

Corollary  4.12.  Det (B(d,n))  = 

Proof.  Let  S'  be  a  minimum  set  of  vertices  in  which  each  letter  of  Ad-i  occurs 
at  least  once.  It  is  easy  to  see  that  |S|  =  Any  permutation  of  Ad 

that  acts  nontrivially  on  any  letter  of  Ad  must  act  non-trivially  on  any  string 
containing  that  letter.  Thus  if  a  G  PtStab(S),  then  0  must  fix  every  letter 
contained  in  any  string  in  S.  Thus  a  fixes  0, 1, . . . ,  d  —  1  and  therefore  also  d. 
We  can  conclude  that  a  is  the  identity  in  both  Sym(«4rf)  and  in  Aut (B(d,n) 
and  therefore  S  is  a  determining  set.  Thus  Det (B(d,n))  <  1]. 

Further  if  |S|  <  |"^A]  then  fewer  than  d  —  1  letters  of  Ad  are  used  in 
strings  in  S.  If  a,  b  G  Ad  are  not  represented  in  S,  then  the  transposition  (a  b ) 
in  Sym(Arf)  is  a  non-trivial  automorphism  of  B(d,n )  that  fixes  S  pointwise. 
Thus  S  is  not  a  determining  set.  □ 

Thus  for  directed  de  Bruijn  graphs,  the  determining  number  and  the 
directed  metric  dimension  can  be  vastly  different  in  size. 
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5  Future  Work 


There  are  several  directions  that  future  work  in  this  research  area  could  take. 
The  first  is  to  continue  the  research  on  identifying  codes  in  de  Bruijn  graphs. 
One  key  result  missing  from  this  paper  is  the  determination  of  the  size  of 
a  minimum  1-identifying  code  in  the  graph  13(2,  n)  for  n  even.  Our  results 
show  that  this  size  is  at  most  2"_1  +  1,  and  at  least  2n_1,  leaving  little  room 
between  the  two  bounds.  Additionally,  constructions  are  still  needed  for  t- 
identifying  codes  in  B(d,n )  for  t  >  3.  As  we  have  shown  that  ^-identifying 
codes  do  not  always  exist  in  directed  de  Bruijn  graphs,  it  would  be  ideal  to 
determine  both  a  formula  for  which  graphs  B(d,n )  are  ^-identifiable  for  a 
given  t,  as  well  as  constructions  when  it  is  known  that  such  an  identifying 
code  exists. 

An  alternative  direction  for  future  research  is  to  consider  these  same 
vertex  subsets  (identifying  codes,  dominating  sets,  resolving  sets,  and  de¬ 
termining  sets)  on  the  undirected  de  Bruijn  graph.  Little  work  has  been 
done  and  even  foundational  results  like  the  size  of  a  minimum  dominating 
set  are  currently  missing.  Basic  Matlab  programs  have  shown  that  many 
more  undirected  de  Bruijn  graphs  are  f-identifiable  than  directed,  and  also 
that  the  minimum  size  of  an  identifying  code  is  much  smaller.  For  example, 
through  brute  force  testing  we  have  determined  that  the  minimum  size  of  a 
1-identifying  code  in  B( 2,  5)  is  12,  whereas  in  the  directed  graph  B( 2,  5)  we 
have  shown  that  the  minimum  size  is  16. 

Finally,  variations  on  the  concept  of  identifying  code  would  be  useful  for 
real-world  applications.  For  example,  one  type  of  variation  known  as  a  k- 
robust  identifying  code  allows  for  up  to  k  sensor  (identifying  code  vertex) 
failures  without  disruption  of  the  identifying  code  properties. 
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